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Formal Languages and Infinite Groups 

ROBERT H. OILMAN 

Abstract. This article is an introduction to formal languages from the 
point of view of combinatorial group theory. 
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1. Introduction 

A formal language is a subset of a finitely generated free monoid. Certain 
classes of formal languages play important roles in computer science and are 
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beginning to have interesting applications to combinatorial group theory. Auto- 
matic groups for example are denned in terms of rational languages and rational 
relations. Rational languages also occur in the study of word hyperbolic groups 
and are useful in computational group theory. The larger class of context-free 
languages can be used to characterize virtually free and virtually cyclic groups. 
The fundamental groups of compact geometrizable three-manifolds are describ- 
able in terms of indexed languages. 

This expository article is an introduction to formal language theory for people 
whose interests include combinatorial group theory. We concentrate on the basic 
properties of rational and context-free languages because these are the languages 
which have had most application to group theory so far. Rational languages are 
easily described; they are the closure of the finite subsets of finitely generated free 
monoids under union, product and generation of submonoids. Rational subsets 
of arbitrary monoids are defined in the same way, but they do not possess all 
the properties of rational languages. 

The rational subsets of a monoid M are precisely the subsets accepted by 
finite automata over M. A finite automaton T over M is a finite directed graph 
with a distinguished initial vertex, some distinguished terminal vertices, and 
with edges labelled by elements from M. The set accepted by T is the collection 
of labels of paths from the initial vertex to a terminal vertex. Figure 1 shows a 
finite automaton over E*, the free monoid over a finite set E containing a and b. 
The symbol e stands for the unit element of E*. In this figure and in all others 
an arrow with no source indicates the initial vertex, and arrows with no target 
identify terminal vertices. This finite automaton accepts the rational language 
{tfV \i>0,j> 0}. 





Figure 1. A finite automaton over E*. 

There are also acceptors, called pushdown automata, for context-free lan- 
guages. Since the standard definition of pushdown automaton although useful 
for understanding aspects of programming is unwieldy, we do not use it. For us 
a pushdown automaton accepting a context-free language in E* is just a finite 
automaton over M c f x E* where M c f is a certain monoid characterizing context- 
free languages. A word to e E* is accepted by such a pushdown automaton if 
there is a path from initial vertex to terminal vertex with label (l,w). This 
approach follows that of several authors. See Section 10 for references. 

Figure 2 shows a pushdown automaton. Pd and Qd are elements of M c f, and as 
before a, b G E. This finite automaton accepts the rational subset {(-P^Q^, a, l b>) 
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(Pd, a) {Q d , b) 




Figure 2. A finite automaton over M c f x E*. 

i > 0, j > 0} of M c / x S*. Construed as a pushdown automaton it accepts the 
set of all words a % b> such that P\Q 3 d = 1 in M c /. Given that P<jQd = 1 and Pd 
and Qd have infinite order, we see that this pushdown automaton accepts the 
language {a n b n n > 0}. 

Replacing M c f by different monoids affords a uniform way of defining au- 
tomata accepting other classes of languages. More precisely we specify M and a 
subset X C M, and consider automata over MxS* which accept w € E* if for 
some x G A there is a path with label (x, u>) from initial vertex to terminal ver- 
tex. Theorem 6.2 characterizes the classes which can be defined this way; they 
fall between full trios and full abstract families of languages. Stack automata 
are considered briefly from this point of view in Section 8. 

We make the following conventions. E* denotes the free monoid over a set 
E. E may be referred to as an alphabet and elements w £ E*, as words. The 
unit element of S* is e, the empty word; and \w\ is the length of the word w. A 
language contained in E* is also called a language over S. E and A will always 
denote nonempty finite sets. In order to avoid some awkward constructions we 
write function and relation symbols on the right. 

2. Rational Sets 

Definition 2.1. The rational subsets of a monoid M are the closure of its 
finite subsets under union, product, and monoid closure. 

We call union, product and monoid closure the rational operations and write 
them as S1US2, S1S2, and respectively. Notice that this notation is consistent 
that for free monoid adopted above. Rational subsets of free monoids are called 
rational languages and also regular languages. 

Theorem 2.2. Let M and M' be monoids and a : M — > M' a homomor- 
phism. 

(i) For any set of generators A of M, each rational subset of M lies the 
submonoid generated by a finite subset of A. 

(ii) The rational subsets of M are closed under rational operations. 

(iii) The image of a rational subset of M under a is a rational subset of M' ; 
and if a is onto, every rational subset of M' is obtained this way. 

PROOF. We prove the first assertion by induction on rational sets. Let S be a 
rational subset of M. If S is finite, then clearly it lies the submonoid generated by 
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a finite subset of A. Suppose S = S1US2 and each Si lies a submonoid generated 
by a finite subset of A; then S is contained in the submonoid generated by the 
union of the two finite sets. The same argument works if S = S1S2 or S = S^, 
and we conclude that (i) holds. 

Part (ii) follows immediately from Definition 2.1, and the last part is proved 
by induction on rational sets. □ 

Finite automata afford a convenient way of describing individual rational sets. 
Definition 2.3. A finite automaton defined over a monoid M is a finite di- 
rected graph with 

(i) Edges labelled by elements of M; 

(ii) An initial vertex; 

(iii) A set of terminal vertices. 

A finite automaton over a subset icMis just a finite automaton over M whose 
edge labels are all in A. 

Since finite automata play such a large role in this exposition, we will refer 
to them as automata; that is, from now on automaton means finite automaton. 
By a path in an automaton we mean a directed path. The label of a path is the 
product in order of the labels of its edges; in particular the label of an empty 
path, i.e., a path of length zero from a vertex to itself, is the unit element. 

Definition 2.4. The set accepted by an automaton is the collection of labels 
of successful paths. A successful path is one from the initial vertex to a terminal 
vertex. 

Figure 1 shows a finite automaton which accepts the rational language a* b*. 

Lemma 2.5. Let A be a subset of a monoid M. Every set accepted by an 
automaton over A is accepted by one such that 

(i) The initial vertex has no inedges; 

(ii) There a single terminal vertex, and it has no outedges. 

Proof. Since the set accepted is not changed by the addition of a new isolated 
terminal vertex, we may assume the set of terminal vertices is not empty. If the 
initial vertex p has inedges, add a new initial vertex p' and duplicate only the 
outedges of po ■ For each terminal vertex p with outedges add a new terminal 
vertex p' . Duplicate the inedges of p, and remove it from the set of terminal 
vertices. Changing the first and last edges of a path gives a correspondence 
between paths from po to p and those from p' to p' . Thus the set of labels 
of successful paths is not altered by the changes made so far. Since terminal 
vertices now have no outedges, identifying all terminal vertices does not affect 
the set of labels of successful paths either. □ 

Theorem 2.6. Let A be any set of generators for the monoid M. A subset 
of M is rational if and only if it is accepted by an automaton over A. 
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Proof. Argue by induction on rational sets to show that every rational subset 
of M is accepted by an automaton. Indeed every finite subset of M is accepted 
by an automaton constructed from a tree, and it is not hard to combine automata 
satisfying the conditions of Lemma 2.5 to accept unions, products, and monoid 
closures. If T accepts S and T' accepts S', identify the two initial vertices to 
obtain an automaton accepting S U S'. Likewise identify the terminal vertex of 
T with the initial vertex of V to accept SS', and the initial and terminal vertices 
of T to accept S* . 

For the converse let F accept S and argue by induction on the number of 
edges of F. If there are no edges, then S equals {e} or <f> depending on whether 
or not the initial vertex is terminal. Otherwise construct an automaton r from 
r by removing an edge e while retaining its source and target vertices p and p' 
respectively. Let a be the label of e. The label w of any successful path in T not 
lying entirely in r is a product 

(1) w = woawia ■ ■ ■ w m , m>l 

for which the following conditions hold. The word Wo is the label of a path in 
r from po to p, and each subsequent Wi except the last is the label of a path in 
r from p' to p. Finally w m is the label of a path in r from p 1 to a terminal 
vertex of T. Construct automata T i7 1 < i < 3 from r by altering the initial 
and terminal vertices of r in the following way. 

(i) Ti has initial vertex p and terminal vertex p; 

(ii) F2 has initial vertex p and terminal vertex p'; 

(iii) F3 has initial vertex p' and the same terminal vertices as F. 

For < i < 3 let Si be the set accepted by These sets are rational by 
induction, and Equation 1 implies 

S = SoUS ia (S 2 a)*S 3 . 

Thus S is rational. □ 

3. Rational Languages 

Rational languages satisfy some properties not satisfied by rational sets in 
general. 

By Theorem 2.6 we take automata over free monoids as defined over the free 
generators unless otherwise stated. 

Theorem 3.1. For any finite alphabet E 

(i) The rational languages over E are the inverse images of subsets of finite 
monoids F under homomorphisms a : E* — > F; and 

(ii) The rational languages over E are closed under Boolean operations. 

Proof. The second assertion follows immediately from the first. Let L be a 
rational language contained in E* and assume L is accepted by a finite automaton 
T over E. For each w £ E* define a binary relation ^ w on the vertices of T by 
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P ~«j p' if and only if there is a path from p to p' with label w. Because S* is free, 
~ w „ equals the composite ~ w o <~„. Hence the map w — is a homomorphism 
from X* to the finite monoid of binary relations on the vertices of T. L is the 
inverse image of the set of all relations ~ such that po ~ p for some terminal 
vertex p. 

Conversely suppose that a : X* — ► F is a homomorphism to a finite monoid 
F, and L = Xa^ 1 for some X C F. Let T be directed graph with vertices F 
and labelled edges x A y for all a;, y G F and aeE such that x(acr) = y. Make 
r into an automaton with initial vertex 1 and terminal vertices X. Since a path 
from 1 with label w ends at wa, T accepts L. □ 

The automaton constructed in the last part of the proof above has a spe- 
cial property. Every vertex has exactly one outedge labelled by each generator. 
Such automata are called deterministic; the others are nondeterministic. For 
automata over monoids which are not free there are other notions of determin- 
istic. It is a fact that each rational language is accepted by a unique smallest 
deterministic automaton, called the minimal automaton of the language. 

Corollary 3.2. Every rational language is accepted by a deterministic au- 
tomaton. 

Theorem 3.3. For every rational language L there is an integer n with the 
property that if w G L and \w\ > n, then w can be written as a product xyz such 
that xy*z C L and < \y\ < n. 

Proof. Let L be accepted by an automaton over S with n vertices. Every 
successful path of length greater than n must have a loop of length strictly 
between and n + 1. □ 

Theorem 3.3 is called the Pumping Lemma for Rational Languages; its usual 
first application is to show that the language {a n b n } is not rational. 

4. Applications of Rational Languages 

Let G be a group. A choice of generators for G is a surjective monoid ho- 
momorphism <t : S* — > G. We will write w for the image, wa, of w G S*. By 
Theorem 2.2 every rational subset of G lifts to a rational language over S. Just 
as group elements are represented by words, rational subsets of groups are repre- 
sented by rational languages. We will say that a choice of generators a : S* — > G 
has formal inverses if S is a union of pairs {a, a -1 } and a _1 cr = (aer) -1 . We 
emphasize that E still generates S* freely as a monoid; there is no cancellation 
in S*. 

The word problem of G is the language W(G) = {w \ w = 1}. W{G) depends 
on the choice of generators, but the dependence is mild. A precise statement is 
given in Theorem 6.4. The following theorem characterizes the word problem of 
finite groups with respect to any choice of generators. 
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Theorem 4.1. The word problem of a finitely generated group G is a rational 
language if and only if G is finite. 

Proof. Suppose £ — > G is a choice of generators. If G is finite, then W(G) 
is rational by Theorem 3.1. For the converse suppose W(G) is accepted by an 
automaton T. We will show that the order of G is at most equal to the number 
of vertices in T. 

Assume without loss of generality that every vertex of T is on a successful 
path. Indeed discarding all edges and vertices not on successful paths does not 
affect the language accepted by T. For any word w e S* there is a word v 
representing the inverse of w. Hence wv is accepted by T, and consequently w 
is the label of a path starting at the initial vertex po of T. In particular every 
element of G is represented by the label of such a path. To complete the proof it 
suffices to show that if w and u are labels of path from p to the same vertex p, 
then w — u. By our assumption there is a path from p to a terminal vertex. Let 
the label of this path be v. We have that wv and uv both represent the identity 
in G whence w = u. . □ 

Theorem 4.2. The subgroup generated by a rational subset of a group is 
finitely generated. 

Proof. Let (S) be the subgroup generated by S C G. Suppose S is rational 
and is accepted by an automaton T over G. If S = <p, then (S) = {1}, so we 
may assume S ^ (j). By Lemma 2.5 we may suppose that T has a single terminal 
vertex p t ; and as above we may discard all vertices and edges not on successful 
paths. Since T accepts a nonempty set, pt is not discarded. It follows from our 
conditions that we can pick a spanning subtree Tq of T with root po, the initial 
vertex of T, and with all edges directed away from the root. The label of any 
successful path in T is a product 

9 = 9ohi9ih 2 g2 ■ ■ ■ 9m-ih m g m . 

in which each gi is the label of a path in To, and each hi is the label of an edge 
ei not in Tq. Let Xi and yi be the labels of the paths in Tq from the root to the 
source and target vertex of respectively, and let z be the label of the path in 
T from po to p t . Since there is at most one path in r between any two vertices, 
x\ = g - Likewise x i+1 = yigi for 1 < i < m, and z = y m g m - Thus 

g = x 1 h 1 y^ 1 x 2 h 2 y2 1 ■ ■ ■ x m h m y^z 

It follows that S C H = (z,xhy~ l ) , the subgroup generated by z and xhy^ 1 
where h ranges over all labels of edges e not in To, and x and y are the labels of 
the paths in r to the source and target of e respectively. To complete the proof 
it suffices to show that the generators for H lie in (S). We have z & S because z 
is the label of a successful path. Suppose e is an edge not in To, and xhy~ l is its 
corresponding generator. Let v be the label of a path in T from the target vertex 
of e to p t . It follows from xhv, yv <E S that xhy^ 1 — (xhv)(yv)^ 1 E (S). □ 
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Next we use rational languages to prove that free groups have the Howson 
property. 

Theorem 4.3. The intersection of two finitely generated subgroups of a free 
group is finitely generated. 

Proof. Let G be a finitely generated free group and S^Ga choice of free 
generators with formal inverses. By Theorem 2.2 we may assume that for i = 1, 2 
the finitely generated subgroup Hi is the image of the rational language Li over 
X. If each Li contains all the freely reduced words which represent elements 
of Hi, then L\ n L 2 contains all freely reduced words projecting to H\ n H 2 . 
Consequently L\ P\ L 2 projects onto H\ l~l H 2 , and Hi n H 2 is finitely generated 
by Theorem 4.2. Thus the following lemma completes the proof. □ 

Lemma 4.4. 7/E = {a, a^ 1 . . .} and L cS* is rational, then the correspond- 
ing language of freely reduced words is rational too. 

Proof. Let L be an automaton defined over X and accepting L, and let Lo be 
the corresponding set of freely reduced words. We will make T into an automaton 
over X U {e}. If for vertices p and q, T has a path from p to vertex q whose label 
freely reduces to the identity, add an edge from p to q with label e unless such 
an edge is not already present. This effect of this addition is to add to L some 
free reductions of words already there. Continue the edge addition procedure as 
long as possible. Since T is finite, the process must terminate; and it is easy to 
see that the language V accepted by the modified automaton is the closure of 
L under free reduction. As Lq is the intersection of L' with the rational set of 
freely reduced words, L is rational. □ 

5. Rational Relations 

In addition to the word problem of G introdyced in Section 4 one can consider 
binary relations induced by the group operation. Let X* — > G be a choice of 
generators, and for each g £ G define a binary relation p g on X* by wp g v if 
wg = v. What can we say about G if p g is a rational subset of X* x X*? For 
any w G X*, wp g is rational by Corollary 5.5. Since wp g is the word problem of 
G if w = <7 -1 , Theorem 4.1 implies that G is finite. 

Suppose that instead of demanding that each p g be rational, we require only 
that the restrictions r g of p g to some fixed rational language L projecting onto 
G be rational. This line of thought leads to the definition of automatic groups. 
By Lemma 5.2 r s -i = t^ 1 is rational if r g is, and it follows from Theorem 5.3 
that if T g and Th are rational, T g u is too. Thus it is sufficient to verify rationality 
of T g for g in a set of generators of G. If we require that for each g in a finite set 
of generators for G, r g is not only rational but is accepted by a special kind of 
finite automaton over X* x X*, then G is automatic. See [7] for details. 
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Definition 5.1. Let M and M' be monoids. A relation p : M — > M' is 
rational if its graph is a rational subset of M x M' . A relation between finitely 
generated free monoids is called a transduction. 

From now on we will not distinguish between a relation and its graph. For 
any relation p : M — > M', mp — {to/ \ mpm'}; and for S C M, Sp is defined 
similarly. If mp = {to'}, a singleton, we write also mp — ml . 

Lemma 5.2. Let M and M' be monoids. 

(i) If p : M — > M' , then the relation p -1 : M' — > M defined by mpm' if and 
only if m'p'm is rational if p is. 

(ii) I/M = M' and pl = {(to, to) | to G L} /or a rational subset L C M, 
i/ierc is rational. 

(iii) 7/L and L' are rational subsets of M and M' respectively, then p = LxL' 
is rational. 

(iv) If p : M — > M' is a partial homomorphism whose domain is a finitely 
generated submonoid of M , then p is rational. 

Proof. To prove the first assertion note that p and p' correspond under the 
obvious isomorphism M x M' — > M' x M, and apply Theorem 2.2. Part (ii) 
is proved similarly. For part (iii) use the canonical injections M — > M x M' 
and M' — > M x M' together with Theorem 2.2 to conclude that L x {1} and 
{1} x Z/ are rational subsets of M x M' . Hence L x V = (L x {1})({1} x L') 
is rational too. Finally if {mi, . . . ,TOfc} generates the domain of p : M — > M', 
then p = {(mi,TOip), . . . , (m k ,m k p)}*. □ 

Theorem 5.3. J/p : M — > E* and r : S* — > M' are rational relations, so is 
por. 

Proof. Pick sets of generators A and A' for M and M' with both sets of 
generators containing the identity. Let Si = S U {e}. By Theorem 2.6 p is 
accepted by an automaton T p over A x Si and t is accepted by T T over Si x A' . 
For each vertex p of T p add an edge from p to p with label (1, e) if such an edge 
is not already present; T p still accepts p. Make the corresponding changes to T T . 
We will see that the automaton T over Ax A' defined as follows accepts por. 

(i) The vertices of Y are ordered pairs (p, q) of vertices from Y p and T T 
respectively; 

(ii) There is an edge from (p, q) to (jp' , q') with label (a, c) if and only if for 
some b e Si there are edges from p to q with label (a, b) and from p' to 
q' with label (6, c) in T p and Y r respectively. 

(iii) The initial vertex is (po,Qo) where po and qo are the initial vertices of 
Y p and Y T respectively; 

(iv) The set of terminal vertices is the product of the corresponding sets for 
Y p and r r . 

This construction is illustrated in Figure 3. There we have labelled some single 
edges with two labels rather than draw two edges with one label each. It is 
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(a,e),(e,e) (e, e) (e, e) (e,e) 




r p accepts p = {(a n+1 ,b)}. T T accepts r = {(b n , c 2 ™)}. 



(a,e),(e,e) (a,e),(e,e) (e, e) (e, e) 




t I 

T accepts por = {(a n+1 , c 2 )}. 

FIGURE 3. Construction of an automaton accepting the com- 
posite of two rational relations. 

straightforward to show by induction on path length that there is a path in T 
from (p, q) to (p' , q') with label (u, w) if and only if the following conditions hold. 

(i) For some word ueS* there is a path in Y p from p to p' with label (u, v) 
and a path in T T from g to q' with label (w, w). 

(ii) In both path labels v is expressed as a product of elements of Ei in 
exactly the same way. 

It follows immediately that if T accepts (u, v), then u(p o t)v. For the converse 
suppose upv and vtw. There are successful paths in T p and T r whose labels 
are (u, v) and (w, w) respectively. In the label of each path the word v occurs as 
a product of a sequence of elements of Si but the two sequences can differ by 
insertion and deletion of e's. Using the extra edges we have added, the successful 
paths can be adjusted to make the two sequences identical. □ 

Corollary 5.4. Rational transductions are closed under composition. 

Corollary 5.5. If p : £* — > M is a rational relation and L C X* is rational, 
then Lp is rational too. 

Proof. Lp is the projection of pl ° p from X* x M to M. Apply Lemma 5.2 
and Theorem 2.2. □ 
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6. Families of Languages 

In this section we use rational relations to define collections of languages 
determined by a monoid M and a distinguished subset X. For lack of a better 
word we call these collections families. In the next two sections we see how 
particular choices of M and X yield context-free languages and stack-automaton 
languages. 

Definition 6.1. The family of languages T{M, X) determined by a monoid 
M and a nonempty subset X C M is the collection of images Xp as p ranges 
over all rational relations from M to finitely generated free monoids. 

If M = {1}, then rational relations from M to E* are sets of the form {l}xl 
where L is any rational language over S. It follows that ,F({1},{1}) is the 
collection of all rational languages. 

By Theorem 2.6 L C X* is in T(M, X) if and only if there is an automaton 
r over M x S* such that L is the collection of all w g S* with (to, w) the 
label of a successful path in T for some m g A". We will say in this case that 
r accepts u>. We have two notions of acceptance. T accepts both a rational 
relation p C M x S* and the language L = Xp C £*. 

We see that the languages in !F(M, X) are characterized concretely in terms 
of automata at least if M and X are themselves sufficiently tangible. A more 
abstract characterization is given in the next theorem. When we speak of the 
image of a language L C X* under a transduction, we refer to a transduction 
of the ambient free monoid E*. A class of languages is closed under rational 
transduction if whenever L is a language in that class, then the image of L 
under rational transduction is in the class too. Likewise a class is closed under 
union if whenever languages L and L' defined over the same alphabet arc in the 
class, so is L U L'. 

Theorem 6.2. Let T = F{M,X) be a family of languages. T contains all 
rational languages and is closed under rational transduction and union. Con- 
versely every collection of languages which contains a nonempty language and is 
closed under rational transduction and union is a family of languages. 

Proof. Pick x g X. Every rational language L over E is the image Xp under 
the transduction p = {x} x L. By Lemma 5.2 p is rational, and so L 6 T . 

To show that T is closed under union we must show that if L and V are 
languages over S in J 7 , then so is LUL'. We have L = Xp and V = Xp' for some 
rational relations p : M -> S* and p' : M -> E*. It follows that LUL' = X(pUp'): 
and since the union of rational relations is a rational relation, we are done. 
Likewise if r : E* — > A* is a rational transduction, then Lt = X(p o r) g T by 
Theorem 5.3. 

For the last part of the theorem let C be a collection of languages containing 
a nonempty language and closed under transduction and union. Observe that 
if we alter a language by substituting new letters for its alphabet, then each 
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version of the language is the image of the other under a homomorphism. Since 
these homomorphisms are rational transductions by Lemma 5.2, we may pick 
a subcollection Co of languages defined over disjoint alphabets such that every 
language in C is the image under homomorphism of some language in Co- 

Now let M be the free monoid over the union of the alphabets of languages 
in Co and define X to be the corresponding union of languages. Since C con- 
tains a nonempty language, X ^ <p. Take T = !F{M,X). Every language in 
Co is the image of X under the partial identity homomorphism whose domain 
is the free submonoid of M generated by the alphabet corresponding to that 
language. Hence every language in C is the image of X under a partial homomor- 
phism whose domain is a finitely generated free submonoid of M. Consequently 
Lemma 5.2 implies C C T . 

It remains to show that T C C. Suppose L = Xp for some rational relation 
p : M — > £*. By Theorem 2.2 p C M x S* where M is the free monoid over 
the union of a finite number of the alphabets used to define M . Consequently 
L = (X (1 Mo) p. By definition of X, X n Mo is a finite union of languages in Co- 
As C is closed under union, X n M G C; and as C is closed under transduction, 

(X n Mo)p e C. □ 

The closure properties of the preceding theorem resemble some well-known 
combinations from computer science. If we drop closure under union, we have 
a full trio; if we add closure under all rational operations, then we obtain the 
definition of a full abstract family of languages. 

COROLLARY 6.3. Every family of languages is closed under intersection with 
rational sets. 

Proof. If L and R are languages over S, then LPiR = Lpn where pn : S* — > 
S* is defined by pn = {(w, w) | w G R}. If R is rational, then by Lemma 5.2 pa 
is a rational transduction. □ 

The next theorem shows that it makes sense to speak of the word problem of 
a group as being in a particular family. 

Theorem 6.4. Let J- be a family of languages. If the word problem for G 
with respect to one choice of generators is in T , then the word problem with 
respect to any choice of generators is also. Further the class of groups whose 
word problems lie in T is closed under isomorphism, finitely generated subgroup 
and finite extension. 

Proof. Suppose a : S* — > G is a choice of generators for G and p, : A* — > H 
is one for the subgroup H C G. Let Lq = lcr -1 and Lh = There is a 

monoid homomorphism / : A* — > S* such that p = f o a, and consequently 
Lh = Lof~ x . It follows that Lq G T implies Lh G T. Taking H = G, we 
see that whether or not Lq G T depends only on G and not on the choice of 
generators. 



FORMAL LANGUAGES AND INFINITE GROUPS 



13 



Clearly the class of groups whose word problems lie in T is closed under 
isomorphism. Thus it remains only to prove the last assertion. At this point we 
are free to make any convenient choices of generators for G and H. Choose a : 
S* — > G to be a choice of generators with formal inverses. Let Y be the Schreier 
diagram of H in G, i.e., the directed graph whose vertices are the right cosets 
of H in G and with edges Hg A Hg' whenever a G S and Hga = Hg'. Taking 
H to be the initial and single terminal vertex of Y, we obtain a deterministic 
automaton which accepts the language of all words in S* representing elements 
of H. 

Y admits a spanning tree Yq rooted at H and with all edges directed away 
from the root. As in the proof of Theorem 4.2 the edges of Y not in To correspond 
to generators of H. More precisely for each edge e not in Yq let w be the label of 
the path in Y from H to the source of e and v the path to the target of e. If e 
has label a, then G H, and the collection of these elements as e ranges 

over all edges not in Yq generates H. 

By hypothesis Lh G T, and we may assume that we have a choice of genera- 
tors n : A* — ► H such that A maps bijectively to the set of generators obtained 
in the preceding paragraph. That is, the elements of A correspond to the edges 
in r — IV Alter the edge labels of Y as follows. Replace each label a of an edge 
in F by (a, e), and replace each label a of an edge in Y — Y by (a, d) where d is 
the element of D corresponding to that edge. Yq now accepts a rational trans- 
duction p : S* — > A* which rewrites each word in E* representing an element of 
H as a word in A* representing the same element. In fact since Y was originally 
deterministic, p is a partial function. It follows immediately that w G Lq if and 
only if wp is nonempty and wp G Lh- Thus Lq = Lnp^ 1 G T . □ 

We have seen that groups with rational word problem are finite, and Mullcr 
and Schupp [21] have shown that groups with context-free word problem are 
finite extensions of finitely generated free groups. On the other hand many 
interesting classes of groups are not closed under taking finitely generated sub- 
groups. If we want to study word problems for these classes, we will need to 
weaken our notion of family. 

7. Context-Free Languages 

In this section we introduce context-free languages. Fix an infinite countable 
set fl, and let fl* be the free monoid over Q. For each word w G £1* define partial 
functions P w and Q w from 0* to itself by 

uP w = uw for all lie!!* uQ w = v if u = vw. 

We math think of P w as pushing w onto a stack and Q w as popping w off a 
stack. Let M c f be the monoid of partial functions generated by the P's and Q's. 
M c f has unit element 1 = P e — Q t and zero element (the empty function). 
The properties listed below are easily verified. 
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Lemma 7.1. P W P V = P wv and Q W Q V = Q V w- Further P W Q V = unless one 
of the words w,v is a suffix on the other. Every element of M c f except can 
be written uniquely as Q W P V . A product of two of these normal forms equals 
except in the following cases. 

wPzxQxPy = QwPzy 
QwPvQzvPy Qzw-Py 

Definition 7.2. T cf = T{M c} , {1}) is the family of context-free languages. 

Definition 7.3. A pushdown automaton is a finite automaton over M c f x S* 
where S* may be any finitely generated free monoid. If p : M c f — > E* is the 
rational relation accepted by a pushdown automaton, we say that the automaton 
accepts the context-free language lp. 

Let us compare Definition 7.3 with the usual definition of a pushdown au- 
tomaton. 



a\ Q2 as 04 as 



a 7 



q 








di 




d 2 








d 3 





Figure 4. A standard pushdown automaton V . 

Figure 4 shows a standard pushdown automaton V . V has a read-only input 
tape, a pushdown stack, and a finite number of internal states q. In Figure 4 
the input tape is horizontal and the stack is vertical. At any instant V is in 
state q and is scanning a square on its input tape. Based on q, the contents of 
the square being scanned, and the top element of the stack, V makes a move. 
During a move V may pop, i.e., remove, the top element of its stack, push a 
finite string of symbols onto the stack, move on to the next square of the input 
tape, and enter a new internal state. It need not do all these things during each 
move. When a symbol is popped from the stack, the symbol below it becomes 
the new top of the stack. At different points is its computation V may have just 
one possible move, a choice of moves, or no moves at all. In particular V cannot 
move if its stack is empty. 

At the beginning of a computation the input word is written left justified on 
the input tape, and the stack is empty except for a special end of stack symbol. 
V is placed in a designated initial state and set to scan the first square of its 
input tape. If there is a sequence of moves which ends with empty stack and V 
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in one of a number of distinguished final states, and if during this sequence V 
reads all its input, then V accepts the input word. There may be other sequences 
of moves which end differently or not at all, but that does not matter. 

Figure 2 shows a pushdown automaton in our sense; call it Q. Let us view Q 
as a standard pushdown automaton and see how it accepts {a n b n }. Q starts by 
pushing one d onto the stack for each a it sees in the input. For each succeeding 
b in the input, Q tries to pop a d from the stack. If the number of b's equals the 
number of a's, Q accepts by emptying the stack and entering its final state. If 
there are too many b's, Q empties the stack before reading all its input; and if 
there are too few, Q does not empty its stack. 

Here is a more formal and shorter demonstration. 

Theorem 7.4. The pushdown automaton of Figure 2 accepts L = {a n b n \ 
n > 0}. 

Proof. It is immediate from Figure 2 that the labels of paths from initial 
vertex to terminal vertex are {(P^Q^, o'b 3 ')}. By Lemma 7.1 P^Q J d = 1 if and 
only if i = j . □ 

In the remainder of this section we present some standard facts about context- 
free languages; but before starting we want to mention a variation on our original 
definition of pushdown automaton. According to Definition 7.3 a pushdown 
automaton is an automaton over M c f x £*, but by Theorem 2.6 we may restrict 
ourselves to automata over 



These automata proceed at a sedate pace, reading at most one input letter 
and performing at most one stack operation per move. Compared to them our 
original pushdown automata might be called accelerated. 
Now here are the results promised above. 

Theorem 7.5. T c f is a full abstract family of languages. 

Proof. By Theorem 6.2 it suffices to show that if L and L' are context-free 
languages over S, then so are LL' and L* . Suppose T and I" are pushdown 
automata accepting L and V respectively. Let d\ , . . . , d n be the elements of f2 
which occur in labels of T and T', and pick e 6 ft distinct from the di's. Alter 
T as indicated in Figure 5 to create a pushdown automaton IV More precisely 



{{Pd,Qd |rfeO}u{i}) x (SU{e}). 



r 



(Qe,e) 



Figure 5. Construction of the pushdown automaton IV 
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add a new initial vertex joined to the old one by an edge labelled (P e ,e), and 
join all old terminal vertices to a single new terminal vertex by edges labelled 
(Q e ,e). Clearly Ti accepts (P e Q v P w Q e ,u) if and only if T accepts {Q v P Wl u). It 
follows from Lemma 7.1 and our choice of e that Ti accepts (1, u) for all u G L 
and that everything else accepted has the form (0, y) for some y G £*. Modify T' 
similarly to obtain which accepts (1, u) for all uel' and possibly (0, y) with 
y G S* — L' . These pushdown automata may be used as indicated in Figure 6 
to construct pushdown automata accepting LL' and L* . □ 



(M) 




FIGURE 6. Construction of pushdown automata accepting LL' 
and L*. 

Our first impulse in proving Theorem 7.5 is to connect the unmodified au- 
tomata r and r' the way Ti and r[ are connected in Figure 6. However, it might 
be that T accepts (P w ,u) and T' accepts (Q w ,v). In this case F followed by V 
would accept uv not necessarily in LL' . Thus modifications are necessary. 

The following theorem is a pumping lemma for context-free languages. Its 
usual first application is to show that the language {a n b n c n } is not context-free. 

Theorem 7.6. For every context-free language L there is an integer k such 
that any word in L of length greater than k can be written as uvwxy so that 

(i) v ^ e or x ^ e; 

(ii) \vwx\ < k; 

(iii) uv l wx % y G L, for all i >0. 

It is an immediate consequence of the Pumping Lemma that a group whose 
word problem is context-free admits a Dchn's Algorithm solution to its word 
problem. Thus such groups are word hyperbolic. The work of Muller and Schupp 
shows that in fact they are virtually free. 

Let L be a context-free language accepted by a pushdown automaton T over 
M c f x E*. We will prove the Pumping Lemma by showing that a sufficiently long 
successful path in T must have one of two structures. Either for some s £ il* the 
path contains a loop with label (P s ,v) followed by a segment with label (l,w) 
followed by a loop with label (Q s ,x); or the path contains a loop with label 
(1, v). The latter case corresponds to the possibility w — x = e. 

Since M c f is generated by {Pd, Qd \ d G 51} x (S U {e}), we may assume that 
r is defined over this set. Our proof depends on an analysis of the word problem 
of M c f with respect to the generating set {Pd, Qd \ d G fi}. 
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Lemma 7.7. Let A\ . . . A n be a nonempty word in the generators {Pd, Qd 
d € £1} representing 1 <E M c f. One of the following conditions holds. 

(i) A\ = Pd, A n = Qd for some d E 0, and either n — 2 or n > 2 and 
A 2 ■ ■ ■ A n -i represents 1; or 

(ii) For some i with 1 < i < n, A\ ■ ■ ■ Ai and Ai + \ ■ ■ ■ A n both represent 1. 
Further either n = 2 or A\ . . . A n has a proper subword of length at least n/2 
which represents 1. 

Proof. It follows from Lemma 7.1 that A\ . . . A n contains a subword PdQd- 
With this observation the first part of the following lemma is easily proved by 
induction on n. The first part implies the last part. □ 

Now let c be the number of vertices of T and choose a positive integer contant 
k > 8c. We permit ourselve to increase k as we proceed. Consider a word zei 
of length at least k, and pick a successful path 7 in T of minimal length with 
label (1, z). It follows from Lemma 7.7 that 7 has a subpath 70 of length between 
k and k/2 and with label (1, zq) for some subword z of z. We will analyze this 
subpath. 

First suppose 70 has a sequence of c consecutive nonempty subpaths 71 , . . . ,-f c 
with labels (1, Zi) for 1 < i < c. Let path ji extend from vertex p t ^i to vertex 
Pi. For some i,j with < i < j < c, the vertices pi and pj must be equal. 
Consequently 7 has a nonempty loop of length at most k with label (l,v) for 
some v e S*. If we alter 7 by traversing the loop i > times, we still have a 
successful path. In particular v ^ e lest there be a shorter successful path than 
7 with label (1, z). Thus we have z — uvy, v ^ e, \v\ < k and uv l y G £ for all 
i > 0. 

Now assume 70 does not have a sequence of c consecutive subpaths as above. 
Write 70 as a sequence of as many nonempty subpaths as possible such that 
each one has label (l,z') for some z' £ £*. Since there are fewer than c of 
these subpaths, one of them, call it 7q, has length at least fc/(2c). Further 
7p cannot itself be decomposed as a product of nonempty subpaths with labels 
(1, z 1 ). By Lemma 7.7 the sequence of lefthand components of edge labels of 70 
must have the form PdP\ ■ ■ ■ B m Qd for some nonempty word B>\ . . . B m in the 
generators {Pd,Qd \ d e 0} representing 1 e M c f. The proper subpath 71 of 
7q corresponding to B\ . . . B m has label (l,<2i) and by assumption cannot be 
decomposed as a sequence of c consecutive subpaths satisfying the conditions of 
the last paragraph. 

We can repeat the argument of the last paragraph with 71 in place of 70 to 
obtain 72. It is clear that if k is large enough, we can find paths 70, 71, . . . , j c 2 
each a proper subpath of the preceeding path and each with label (1, z') for some 
z' E E*. Since there are only c 2 pairs of vertices from T, two of these paths must 
begin at the same vertex and end at the same vertex. It follows that for some 
words r,s,r's' £ f2*, 7 has a subpath consisting of a loop with label (Q r P s ,v) 
followed by a segment with label (1, w) followed by a loop with label (Q r 'P s ' , 
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Further at least one of the loops is nonempty and Q r P s Qr'Ps' = 1- The latter 
condition implies the first loop has label P s and the second Q s . Consequently 
if we alter 7 by traversing each loop i times instead of one, we we still have 
a successful path. In particular vx ^ e lest there be a shorter successful path 
than 7 with label (l,z). Thus we have z = uvwxy, vx ^ e, \vwx\ < k and 
uv l wx l y G L for all i>0. 

Our argument has shown that if k is sufficiently large, all the conditions of 
the pumping lemma hold. 



8. Stack Automata 

In this section we briefly consider stack automata. A standard stack automa- 
ton is a pushdown automaton which can read but not alter the contents of its 
stack below the top. Figure 7 shows a stack automaton reading an element from 
the interior of its stack. The stack is shown in two pieces in order to suggest 
how two words may be used to represent both the contents of the stack and the 
stack location being read by the automaton. 



a\ ai 03 04 (Z5 qq a-j 



q 


di 


d 2 


d 3 




t 




d 5 


C?4 



Figure 7. A standard stack automaton. 



The appropriate monoid for a formal definition of a stack automaton is built 
out of the monoid M c f of partial functions P w and Q w defined in the preceding 
section. Since we will need to know when a stack is empty, we add a new partial 
function 



uE = e if u = e 



and define M\ to be the submonoid of partial functions generated by M c f U {E}. 



Lemma 8.1. Every element of Mi exceptO can be written as Q W P V orQ w EP l 
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A product of two such terms equals except in the following cases. 



Q wPzxQ xPy 


QwPzy 


Q w PyQ zv Py 


QzwPy 


Qw-^PzxQxPy 


QwPsPzy 


Q w Py Q z v P Py 


Q zwEPy 


QvjP Px QxP Py 


QvjPsPy 



Consider the monoid Mi x Mi acting from the right on ft* x ft*. We think of 
{w,v) S ft* as representing a stack which contains wv r , where v r is v read from 
right to left. In Figure 7 w — di<fadz, and v = . . .d 5 d 4 . The leftmost symbol of 
w is at the top of the stack, and the automaton is reading the rightmost symbol 
of v inside the stack. The operations of moving up and down in the stack are 
(Qx,P x ) and (P X ,Q X ) respectively. Ordinary stack operations are permitted 
only at the top of the stack, that is when w = e. They are (E, P x ) and (E, Q y ). 
M sa is the submonoid of Mi x Mi generated by all these elements. 

Definition 8.2. T sa = T(M sa ,{(E,\)}) is the family of stack-automaton 
languages. 

In other words the stack-automaton languages are the images of (E, 1) under 
rational relations p : M sa — > £*. 

Definition 8.3. A stack automaton is a finite automaton over M sa x E* 
where S* may be any finitely generated free monoid. If p : M sa — > S* is the 
rational relation accepted by a stack automaton, we say that the automaton 
accepts the stack-automaton language (E, l)p. 



(E,P d7 a) {P d ,Qd,b) {Qd,Pd,c) {E,Q d ,e) 




Figure 8. A stack automaton S accepting {a n b n c n }. 

Figure 8 shows a stack automaton S which accepts the language {a n b n c n }. S 
works by 

(i) Pushing an end of stack marker onto its stack. 

(ii) Pushing one d onto its stack for each a; 
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(iii) Matching d's on its stack against successive 6's until it reaches the bot- 
tom of the stack; 

(iv) Matching d's on its stack against successive c's until it reaches the top 
of the stack; 

(v) Emptying the stack. 

Lemma 8.4. The automaton S in Figure 8 accepts {a n b n c n }. 
PROOF. Any successful path has label 

(E l+1 P^P e Q e Q k d E m+ \ PeW d QePeP k d Q7Qe, oW, ) = 
{EP>Q$E, P e P l d Q 3 d QePeP^Q7Qe, aW). 

The middle entry is E if and only if j = k, and last entry reduces to 1 exactly 
when i = j and k = m. □ 

9. Grammars 

In this section we briefly discuss how grammars are used to generate lan- 
guages. We will show that the languages we have called context-free are exactly 
the languages generated by context-free grammars and thus coincide with the 
standard context-free languages. 

Definition 9.1. A grammar consists of 

(i) A finite set of terminal symbols S; 

(ii) A disjoint finite set of nonterminals A; 

(iii) A start symbol S G A; 

(iv) A finite set of productions V = {{a,f})} c(SU A)* x (E U A)* such 
that each a contains at least one symbol from A. 

Definition 9.1 is the usual one, but many variations exist. (EUA)* is called the 
set of sentential forms. Each grammar determines a binary relation on sentential 
forms. This relation is defined in stages. First we say that a sentential form 7 
directly derives another sentential form 7' with respect to a given grammar if 7' is 
obtained by substituting the righthand side of some production for the lefthand 
side in 7. The reflexive transitive closure of direct derivation is called derivation. 
We write 7^7' for direct derivation and 71 — ► 72 for derivation. The language 
generated by a grammar is the set of words over the terminal alphabet derivable 
from the start symbol. From now on we will write a — > (3 for productions instead 
of (a,0). 

Definition 9.2. A grammar is regular if its productions are of the form A — ► 
aB or A — > a where A and B are nonterminals and a is a terminal. 

Regular grammars are just another way of looking at finite automata over free 
monoids, and they generate exactly the regular languages. Indeed suppose that 
r is an automaton over S accepting L C S*. Assume its vertices are p,q,.. ., 
and label them with distinct symbols P,Q, . . . not in S. Let Pq be the label of 
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the initial vertex po. To each edge p — ► q associate the production P — ► aQ. If 
q is a terminal state, add the production P — > a. A successful path 



P0-»Pl^--- ~> Pm-l^Pr, 



to a terminal vertex p m is essentially the same as a derivation 

Pq — * &1-P2 — * a1.a2.P2 Ol .. . «m-lfm-l — > Ol . . . a 



Thus the grammar consisting of 

(i) Terminal alphabet S; 

(ii) Nonterminal alphabet {P, Q, . . .}; 

(iii) Start symbol Po; 

(iv) Productions {P — > aQ} U {P — > a} as above 

generates L. This argument can be reversed to show that any regular grammar 
can be converted to a finite automaton accepting the language generated by 
the grammar. If the grammar has a production P — > a but no corresponding 
production P — ► aQ, one can chose a new nonterminal Q and add the missing 
production. Since there are no productions with Q on the lefthand side, the 
language generated is not changed. 

Now we proceed to context-free grammars. 

Definition 9.3. A grammar is context-free if the lefthand side of every pro- 
duction consists of a single nonterminal. 

Let F be a free group of rank two and W(F) the word problem of F with 
respect to a choice of generators S = {a, a" 1 , 6, 6 -1 }, X* — > F such that a 
and b generate F freely. We will show that W(F) is generated by a context- 
free grammar. It follows from this result and Theorem 6.4 that every finitely 
generated group with a free subgroup of finite index has a context-free word 
problem. 

The key to proving that W(F) is context-free is its recursive nature. Any 
nonempty word which projects to the identity is either a product of shorter words 
which project to the identity, or has one of the forms ai>a _1 , bvb -1 , a~ 1 va, b~ 1 vb 
for some v € W(F). Accordingly we let Q be the grammar with productions 

S — > e, S — > SS, S — > aSa~ , S — > bSb~ , S — > a^Sa, S^>b~ 1 Sb. 

A sample derivation S — > abb~ 1 b~ 1 ba~ 1 with respect to £/ is given by 

To show that ^ is indeed a grammar generating IV(P) there are two steps, both 
easy. First prove by induction on derivation length that if S A w, then w = 1. 
Second use the observations on W(.F) made above to prove by induction on word 
length that if w £ W(F), then there is a derivation S —* w. 

Our next goal is to show that Definition 7.2 defines precisely the class of 
languages generated by context-free grammars. 
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Lemma 9.4. If L is generated by a context-free grammar, then L is gener- 
ated by a context-free grammar in which the righthand side of every production 
consists entirely of nonterminals or is a single terminal or is e. 

Proof. For each terminal a add a new variable A and a new production 
A — > a to the grammar. Replace all occurrences of terminals in the original 
productions by the corresponding new variables. It is straightforward to check 
that L remains unchanged. □ 

Lemma 9.5. If there is a derivation S A w with respect to a context-free 
grammar Q, then there is a derivation in which productions are always applied 
to the leftmost nonterminal. 

Proof. We show by induction on derivation length that for all nonterminals 
A and derivations A A w of words in the terminal alphabet there is a leftmost 
derivation. Clearly any derivation of length one is leftmost for it must consist 
of the single step A — > w. The first step of any longer derivation A A w must 
be A — ► wqAiWi ■ ■ ■ A n w n where the Ai's are nonterminals and the Wi's are 
words in the terminal alphabet. The remaining steps in the derivation amount 
to derivations Ai A m such that w — wqU\ ■ ■ ■ u n w n . By induction on derivation 
length, these derivations may be taken to be leftmost from which it is clear that 
there is a leftmost derivation A A w. □ 

Theorem 9.6. A formal language is generated by a context-free grammar if 
and only if it is accepted by a pushdown automaton. 

Proof. First suppose that L C £* is generated by a context-free grammar 
Q; we will use Q to construct a pushdown automaton accepting L. There is no 
loss of generality in assuming that Q satisfies the condition of Lemma 9.4 and 
that its nonterminals are in the set Q introduced at the beginning of Section 7. 
For any sentential form a, a r stands for the reverse of a, i.e., a read from right 
to left. Define T to be a pushdown automaton with an initial state, qo, and a 
terminal state qt- Add an edge from qt to qt with label 

(i) {QaP<x t , e) for each production A — > a in Q with a consisting of nonter- 
minals; 

(ii) {Qa, a) for each production A — > a with a G E U {e}. 

Add one more edge from q to q t with label (Pg, e) where S is the start symbol 
for the grammar. Prove by induction on path length that if there is a path 
from qo to q t with label (P 7 r,w) then there is a leftmost derivation S A wj; 
and prove the converse by induction on derivation length. In particular there 
is a successful path with label (P € , w) = (1, to) if and only if there is a leftmost 
derivation S — > w. It follows from Lemma 9.5 that L is the language accepted 

by r. 

Now suppose L C E* is a context-free language. By Theorem 2.6 L is accepted 
by a pushdown automaton T over {Pd, Qd \ d G 0} x (E U {e}). Let po be the 
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initial vertex of T, and for each pair of vertices p, q define L p . q to be the language 
accepted by T with its start state changed to p and with single terminal state q. 
We will construct context-free grammars Q p ^ q for these languages; the grammars 
will be identical except for their start symbols. As L is the union over all terminal 
vertices q of the languages L po _ q , we obtain a grammar for L by adding a new 
start symbol S and productions S — + A POiq to the union of the grammars G P , q 
over all terminal vertices q. 

To define Q p . q start with terminals S and one nonterminal A VA for each pair 
of vertices p, q of T such that there is a path in T from p to q with label (1, w) 
for some w G £*. In particular there are nonterminals A p/p for all vertices p of 
r. Add 

(i) Productions A p , q — > A p , T A rA for all triples p, r, q such that the corre- 
sponding nonterminals exist; 

(ii) Productions A p , q — ► aA 1%s b for all nonterminals A P:q such that there is 
a nonterminal A r , s , an edge labeled (a, Pd) from p to r, and one labeled 
(b, Q d ) from s to q; 

(iii) Productions A PtP — > e for all vertices p. 

A straightforward argument using Lemma 7.7 and induction on path length 
shows that if (1, w) is the label of a path fromp to q in T, then there is a derivation 
A VA ^> w. It is also straightforward to show by induction on derivation length 
that A p-q — > w implies that (l,w) is the label of a path from p to q in T. It 
follows from these results that the grammar G PtQ with start symbol A Ptq and 
other components as above generates L p ^ q . □ 

10. Literature 

The first paragraph of the introduction mentions several applications of formal 
language theory to group theory; here are the corresponding citations. Auto- 
matic groups and word hyperbolic groups were introduced by Epstein, Cannon, 
Holt, Levy, Paterson and Thurston [7], and Gromov [14] respectively. Sims' 
book [24] contains many applications of rational languages to computational 
group theory, and in Chapter 4 there is an automata-theoretic analysis of finitely 
generated subgroups of free products of cyclic groups. Muller and Schupp [21] 
showed that a group has context-free word problem if and only if its word problem 
is virtually free. For a survey of further developments along these lines including 
the characterization of virtually cyclic groups by one-counter languages see the 
survey by Herbst and Thomas [16]. Bridson and Gilman [5] have character- 
ized fundamental groups of geometrizablc three-manifolds in terms of indexed 
languages. 

One of the earliest investigations of word problems of groups in terms of formal 
languages is by Anisimov [1]. Theorem 4.2 is usually credited to Anisimov and 
Seifert [2] ; a special case which extends immediately to the general result is given 
in Chapter 6 of Johansen's dissertation [18]. 
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There is another concept of automatic group which predates the one referred 
to above, namely a group of invertible rational transductions of a free monoid. 
Recall that by Corollary 5.4 rational transductions are closed under composition. 
Automatic groups in this sense with intermediate growth have been discovered 
by Grigorchuk [13]. We note that the class of groups whose elements are ra- 
tional transductions permuting a fixed rational language includes both types of 
automatic group. 

The technique used in this article of defining different kinds of automata 
in terms of rational subsets of monoids is well-known and probably grew out 
of the observation that popping an element off a stack is a right inverse to 
pushing it on. Similar tacks are taken by Brainerd and Landweber [3, Chapter 4] , 
Eilenberg [9, Volume A, Chapter X], Floyd and Biegel [10], Goldstine [12], 
and Salomaa, Wood and Yu [25]. Most of these cited works include diagrams 
resembling Figure 2. For standard presentations of language theory one can 
consult Harrison [15] or Hopcroft and Ullman [17]. Revesz [23] has a slightly 
different perspective, and Conway [6] gives an extensive treatment of regular 
languages. 

Although we have not mentioned them here, there are many applications of 
language theory to semigroups and monoids. See Lallement [19], Book and 
Otto [4], Madlener and Otto [20], as well as Eilenberg's two volumes. 
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